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Communicated by R. B. Goldschmidt, January 26, 1948 


At the beginning of the prepupal stage, the salivary gland of Drosophila 
becomes enormously inflated. This inflation is due to the appearance of a 
large amount of material in the lumen of the gland. Prior to the prepupal 
stage this material is in the cytoplasm of ‘the gland cells but not in the 
lumen of the gland. It is undoubtedly the secretion released from the cells 
into the lumen soon after the larval stage terminates. The origin and 
movements of this matesial will be described in detail elsewhere. 

If an inflated gland is isolated from the prepupa and its tissue is broken 
in water, the secretion flows out, and unless the water is distinctly acid it 
dissolves quickly, forming a viscous solution. If, on the other hand, an in- 
flated gland isolated from a prepupa is placed in 95% alcohol, the secretion 
immediately hardens and becomes opaque while the tissue remains trans- 
parent. The secretion can be easily isolated in this condition by removing 
the tissue with needles under the dissecting microscope. The isolated 
secretion retains the same solubility in water as before alcohol treatment 
provided it is not kept too Jong in alcohol. If an isolated secretion is trans- 
ferred to absolute alcohol, it immediately becomes brittle and loses its 
solubility in water except at high pH’s. In its native state, i.e., before 
alcohol treatment, most of this material does not dissolve in common lipid 
solvents. 

By separating the inflated glands from about 800 prepupae of Drosophila 
melanogaster in isotonic saline and isolating the secretions as completely as 
possible from the tissues in 95% alcohol, about 1600 pieces of secretion were 
collected. These were transferred to ether as soon as isolated from the 
tissues and exhaustively extracted, and the residue was thoroughly dried. 
From the weight of 600 pieces of the residue, the average weight of one 
piece of secretion was found to be approximately 4 ug. The ether-soluble 
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fraction of the secretion was found to be very small in proportion to the 
residue. Its chemical nature will be described elsewhere and only that of 
the residue will be reported here. 

The residue is insoluble in all common organic solvents and even in dilute 
or concentrated mineral acids unless heated, but it dissolves readily in weak 
or strong alkali solutions without heating. A sample of residue was dis- 
solved in dilute NaOH solution and the solution was neutralized with HCl 
and tested for various color reactions with the following results: positive for 
ninhydrin and biuret reactions; slightly positive for Molisch reaction; and 
negative for Millon’s, Hopkins-Cole and xanthoproteic reactions. These 
results suggested that the substance is a protein containing an unusually 
small amount, if any, of aromatic aminoacids. The ultra-violet absorption 
spectra showed no selective absorption between 210 uw and 310 uw. The 
absence of selective absorption at about 290 wu agrees with the negative color 
reactions for aromatic amino acids. The absence of selective absorption at 
260 u indicates absence of nucleic acids in the residue. These tests were re- 
peated with several samples with consistently the same results. The resi- 
due is, therefore, not significantly contaminated with the gland tissue, for 
otherwise the nucleic acid would be detected. In fact, microscopic exam- 
inations revealed that the number of cells present in the residue was so ex- 
tremely small that they can be entirely. neglected. It may be added that 
the dissolved residue is precipitated by phosphotungstic acid but not by 
trichloroacetic acid. : 5 

The nitrogen content of the substance was found by micro-Kjeldahl analy- 
sis to be 10.8%. This figure was obtained from one sample of 4.8 mg. and it 
is unusually low for a protein. When samples of the residue were hydrolyzed 
with 10% HCl for 24 hours with constant boiling of the acid and when the 
acid was evaporated on a steam bath, a substantial amount of crystalline 
inorganic salt was found in the dry residue. Although the proportion of the 
salt in the samples was not determined, it seems that the unusually low 
figure for the N content found in the above analysis can well be aceounted 
for by the presence of the inorganic salt, and a large amount of glutamic acid 
and some glucosamine, both of which have low nitrogen content (see below). 

As a preliminary to the two-dimensional chromatogram analysis, three 
samples of small quantities of the residue were hydrolyzed with 10% HCl 

for 24 hours with constant boiling of the acid and the hydrolyzates were 
analyzed on filter paper strips, one with collidine and two with phenol as the 
solvent. For the principle and technique of the partition chromatogram 
method of protein analysis, the reader is referred to Consden, Gordon 
and Martin (1944)! and to Dent (1946)”. In one of the phenol strips an in- 
,tensely colored band was present at Rf = 0.21. In the control strip in 
which glutamic acid was used, the Rf of the band was 0.20. In the other 
phenol strip a strongly colored band was also present at about fhe same 
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position. In the collidine strip also an intensely colored band was present 
at Rf = 0.12. In the control strip in which glutamic acid alone was used, 
the Rf of the band was 0.15. These bands in the experimental and control 
strips are sufficiently close to each other to indicate that the band in the ex- 
perimental strip represents glutamic acid. Less intensely colored bands 
were found at various other positions on all three strips. 





FIGURE 1 


The result of a two-dimensional chromatogram analysis of the acid hydrolyzate of 
the protein in the secretion of the salivary gland in Drosophila is shown. Each circle 
represents one amino acid or an oxidation product of an amino acid. The hydrolyzate 
was put at the position of the cross at the lower right and phenol and collidine 
were run in the respective directions indicated by the arrows. A, cysteic acid; B, 
aspartic acid; C, glutamic acid; D, serine; E, glycine; F, threonine; G, alanine; 
H, tyrosine; I, glucosamine; J, methionine sulfoxide; K, valine; L, lysine; M, arginine; 
N, proline; O, leucine; P, phenylalanine. 


Since most of the bands on the strips represented mixtures of two or more 
amino acids, a two-dimensional analysis was made to separate all the amino 
acids contained in the residue in order to identify them individually. 


Eight hundred yg. of the residue was taken for this purpose. This is about 


ten times as much as the amount required for an adequate analysis of an 
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ordinary protein by the chromatogram method. ‘The residue was hydro- 
lyzed with 10% HCl for 24 hours in a sealed glass tube placed in boiling 
water and the hydrolyzate was analyzed on a square filter paper with phenol 
and collidine as solvents. The result is shown in figure 1. Sixteen ninhy- 
drin-positive spots are recognized, fifteen of which are identified as amino 
acids. The color of these spots is, on the original paper, only about half 
as intense as would be expected if the sample consisted entirely of a protein 
of usual N content. This indicates the presence in the residue of a consider- 
able amount of ninhydrin-negative substance. The presence of a large 
quantity of inorganic salt in the residue could account also for this result. 


One spot (J) is at a position which no known amino acid occupies. 
This spot corresponds exactly to that of glucosamine. A very weak, positive 
Molisch reaction indicates the presence of a small quantity of carbohydrate 
in the residue and supports the chromatogram analysis for the presence of 
glucosamine. 


The central fading of the aspartic acid (B), glutamic acid (C), glycine 
(EZ) and arginine (M) spots denotes very great strengths of color of these 
spots. From the areas and color intensities glutamic acid appears to be 
the largest in proportion and threonine (F) next to it. Phenylalanine (P) 
and tyrosine (/) are of negligible proportion as indicated by the extreme 
faintness of the spots. Tryptophane is absent on the paper, These facts 
agree well with the absence of selective absorption at 290 u and the negative 
aromatic amino acid color reactions. Methionine sulfoxide (/) and cysteic 
acid (A) seem to be artefacts produced by oxidation of the protein and these 
are probably parts of methionine and cystine, respectively, in the original 
protéin. 


The presence of glutamic acid in the largest proportion in this protein is a 
noteworthy characteristic, but not unusual as a number of proteins are 
known to contain a disproportionately large quantity of this amino acid. 
The unique feature, however, is the unusually large proportion of threonine- 
(F) and serine (D). Dr. Dent informs me that he knows of no other protein 
so far analyzed by the chromatogram method which shows such an intensely 
colored and broad spot of threonine in relation to the other spots. Al- 
though an accurate quantitative determination is not possible, the fact that 
this spot is so unusually out of proportion indicates an extraordinarily high 
content of threonine in the protein of the residue. The serine spot is like- 
wise unusually high in intensity in proportion to the other spots, indicating 
‘ also an extraordinarily large content of this amino acid in the protein. 


As pointed out already, this protein together with the ether-soluble sub- 
stance is secreted into the lumen of the gland as soon as the animal begins to 
pupate. The secretion remains in the lumen for a short time and disap- 
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pears. Where it goes and what it is used for is now being investigated. 
Although no evidence has yet been obtained, the fact that it disappears from 
the gland at an early pupal stage suggests a possibility that it either enters 
into the formation of a part of the pupal body or participates in the chemical 
processes of histogenesis. One possible function it may have would 
be to provide through digestion a very large amount of glutamine. Glu- 
tamine readily enters into transamination reactions and its presence thus 
would provide an ample source of a variety of amino acids which must be 
required for the rapid rate of protein synthesis occurring at this stage of 
development. Similarly, serine which is present in an unusually large pro- 
portion in the secretion protein may, after being freed from the protein, be- 
come an abundant source of precursors of other amino acids such as glycine 
and cystine (Stetten, 1942). As to threonine, which is contained also in an 
extraordinarily large amount in this protein, its possible use in histogene- 
sis cannot be surmised, since only relatively little is known concerning the 
metabolism of this amino acid. 

I am gratefully indebted to Professor Alexander Dounce for his invaluable 
suggestions, advice and criticism and to Dr. C. E. Dent for the chromato- 
gram analyses. This investigation was undertaken in Professor Curt 
Stern’s former laboratory at the University of Rochester. My hearty 
thanks are due to Professor Stern for advice and suggestions throughout 
this work. 


1 Consden, R., Gordon, A. H., and Martin, A. J. P., Biochem. J., 38, 224 (1944). 
2 Dent, C. E., The Lancet, Nov. 2, 1946. 
3 Stetten, A., J. Biol. Chem., 144, 501 (1942). 


LINKAGE STUDIES OF THE RAT. IX. CATARACT 


By W. E. CASTLE AND HELEN DEAN KING 
DIVISION OF GENETICS, UNIVERSITY OF CALIFORNIA, AND WISTAR INSTITUTE 


Communicated February 20, 1948 


Cataract is a dominant character discovered in an albino strain of rats 
by Smith and Barrantine, and described by them in the Journal of Heredity, 
34, 8-10, 1943. A stock of cataractous individuals, 3 females and 2 males, 
was kindly supplied to us for linkage studies in September, 1943. 

No indications have been found of linkage between the gene for cataract 
and any other gene of the rat. The tests made and their results are sum- 
marized in table 1. The progress of the tests has been slow because the 
cataract is recognizable only in individuals which have unpigmented eyes 
albinos or pink-eyed yellows. Colored individuals must be subjected to 














136 GENETICS: CASTLE AND KING 


Proc. N. A. S. 


a breeding test in order to ascertain whether or not they are carriers of 
the gene for cataract. 

The test procedure is as follows. A cross is made between a cataractous 
individual and an individual carrying the other gene to be tested, as for 
example, Cataract X Curly. An F, individual will carry Ca and Cu in 
the repulsion relationship. 

Such an F; animal, if crossed to normal albinos, will produce four classes 
of offspring, of which the first and fourth will be crossover combinations, 
Ca Cu, Ca, Cu, and 0 (neither Ca nor Cu). If the sum of the crossover 
classes is significantly less than that of the other two classes, linkage will 
be indicated. In no cross made has such an indication been found. See 
table 1. 

We conclude that the gene for cataract does not lie in any of the four 
known linkage systems, nor in chromosomes tagged by the genes, Cw, 
j,d,Aandh. This indicates that it will serve as marker of a tenth chromo- 
some. 


TABLE 1 


TEsts FOR LINKAGE BETWEEN THE DOMINANT GENE CATARACT (Ca) AND OTHER 

GENES OF THE RAT. KNOWN LINKAGE SYSTEMS INVOLVED IN THE TESTS ARE IN- 

DICATED BY ROMAN NUMERALS. FREQUENCIES OF CROSSOVER CLASSES ARE SHOWN 
IN ITALIC NUMERALS 


TOTAL TOTAL 





7-———— PROGENY CLASSES CROSS- NON-CROSS- GRAND 
GENES REPULSION COUPLING OVERS OVERS TOTAL DEV./P.E. 
1 CaXp Cap Ca 99 109 208 5.0/4.8 
99 109 
II CaX Cu CaCu Ca Cu 0* CaCu Ca Cu 0 94 95 189 0.5/4.6 
12 20 28 34 19 21 27 28 
III Ca Xk Cak Cak 0 8 7 15 0.5/1.3 
7 B21 
IV Ca Xhr. Cahr Ca 32 26 58 3.0/2.5 
32 26 i 
IV CaXwo Cawowo 6 6 12 0/1.1 
6 6 
Ca X Cue2 CaCur Ca Cue 0 269 295 564 13.0/8.0 
111 1385 160 158 
Ca Xj Caj Caj 0 Caj Ca j O 54 48 102 3.0/3.4 
13 5 68 17 16 17 20 
CaXd CadCad0 17 23 40 3.0/2.1 
11 1766 
CaXA CaACaA 0 CaA Ca A 0 37 41 78 2.0/2.9 
13 4 113 14 10 1112 
Ca Xh Cah Cah 0 Cah Ca h O 21 13 34 4.0/1.9 
1 #236 3 3 115 


* Showing neither gene. 
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THE SUPPRESSION OF CROSSING OVER IN INVERSION 
HETEROZYGOTES OF DROSOPHILA PSEUDOOBSCURA 


By THEODOSIUS DOBZHANSKY AND CARL EPLING 
COLUMBIA UNIVERSITY, NEw YORK AND UNIVERSITY OF CALIFORNIA, Los ANGELES 


Communicated February 23, 1948 


Observations and experiments on natural and artificial populations of 
Drosophila pseudoobscura have shown that within a given population 
flies of which the third chromosomes differ in gene arrangement (inversion 
heterozygotes) often possess higher adaptive values than those of which 
the homologous pair has the same arrangement in both chromosomes 
(inversion homozygotes). * In many localities the inversion hetero- 
zygotes outnumber the homozygotes. The welfare of the populations 
of these localities, therefore, seems to depend upon the maintenance of a 
high adaptive level in the inversion heterozygotes relative to the homo- 
zygotes. In other localities, however, the populations consist mainly 
or entirely of inversion homozygotes. In these the adaptive level of the 
homozygotes must be at least tolerably high. 

In theory, two genetic mechanisms could bring about adaptive differences 
between and among the inversion homozygotes and heterozygotes. First, 
the action of genes in development may be altered simply by differences 
in their order within the chromosome (position effect). Second, chromo- 
somes with different gene arrangements may be essentially alike but may 
carry different complexes of genes which make their possessors physio- 
logically and adaptively different. At the same time they might be co- 
adapted in such a way as to produce the higher adaptive level of the 
inversion heterozygotes. Of these two mechanisms the first, position 
effect, may or may not play a réle in the evolution of the third-chromosome 
inversions of this fly. . However this may be, the second is both effective 
and important. For if the superior adaptiveness of inversion heterozygotes 
were due to position effect alone, then each arrangement would be expected 
to have the same relative superiority regardless of the geographic origin 
of the chromosomes concerned. ‘This is not the case. The evidence shows 
that inversion heterozygotes are adaptively superior to the corresponding 
homozygotes only if the chromosome pair has been drawn from the same 
or neighboring localities. They are no longér superior when each member 
of the pair is drawn from distant populations (as for example, from southern 
California and the central Sierra Nevada). The gene contents of the 
chromosomes from these populations are not co-adapted to each other 
in such a way as to produce adaptively superior heterozygotes when 
combined.* 

The biological function of inversions in the natural populations of 
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Drosophila and their principal réle in the evolution of these insects might 
be conceived to be the suppression of crossing over between gene complexes 
which have reached an adaptive equilibrium. Crossing over would 
destroy these complexes and would result in gene combinations of different 
adaptive values, but would be prevented by the binding effect of the 
inversions. Any inversion which is intrinsically neutral, that is, which 
produces no position effects, would accordingly spread through a population 
provided that it would guard from disintegration a gene complex well 
adapted to the habitat concerned. This would be particularly important 
where the highest adaptive values are found in heterozygotes. The degree 
to which such gene complexes are maintained by the binding effect of 
inversions would accordingly depend upon the degree to which crossing 
over is suppressed. The present study was designed to secure this in- 
formation. 


TABLE 1 


RECOMBINATION OBSERVED IN THE Cross or Sc pr cv 9 X or Sc pra o& 
WILD 


STANDARD ARROWHEAD CHIRICAHUA TREE LINE 
{wild type 2035 4473 3748 2708 
\or pr cv 769 1251 873 558 
for 1781 175 14 3 
1 \ pr oo 738 62 1 2 
for pr 357 2 
2 \ cv 425 7 
for cv 260 a 
12 | by 311 4 
Total 6676 5074 4636 3271 


Three wild strains of Drosophila pseudoobscura carrying Standard, 
Arrowhead and Chiricahua gene arrangements, respectively, and a strain 
carrying the Tree Line arrangement were employed. The former were 
originally collected at Pifion Flat, San Jacinto Mountains, California, 
the latter was collected at Mather, California, in the Sierra Nevada. The 
inversion heterozygotes of both localities are known to be adaptively 
superior to the homozygotes. The third chromosomes bearing the gene 
arrangements designated have been described previously.‘ Flies of each 
strain were outcrossed to a strain homozygous for the third chromosome 
recessives orange (or), purple (pr), crossveinless (cv) and the dominant 
Scute (Sc). The or Sc pr cv chromosome is known to have the Standard 
arrangement. / females resulting. from this cross, were then testcrossed 
to or Sc pr cv males. The progeny counts of the testcrosses are summarized 
in table 1. The gene Sc was disregarded in the counts because its mani- 
festation in heterozygous condition is variable in different strains.° 
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The observed percentages of recombinations are as follows: 


INTERVAL STANDARD ARROWHEAD CHIRICAHUA TREE LINE 
or-pr 46.3 4.0 0.3 0.15 
pr-cv 20.3 . 0.2 0 0 


Because the distance in the chromosome between the genes or and pr 
is so great that considerable double crossing over must take place in the 
flies with Standard chromosomes, another experiment was arranged. 
The four wild type strains were crossed to flies which carried a third 
chromosome with the recessives or and pr and the dominants Blade (BI) 
and Sc. F, females showing B/ and Sc were selected from the progeny 
and testcrossed to homozygous or pr males. The progeny counts of this 
testcross are shown in table 2. 


TABLE 2 


RECOMBINATION OBSERVED IN THE Cross or Bl Sc pr 2 X or pr S&F 


WILD 
STANDARD ARROWHEAD CHIRICAHUA TREE LINE 
fwild type 1933 3555 7483 2218 
\or BI Sc pr 1147 1954 3046 725 
1 for 313 99 27 7 
\BI Sc pr 220 . 54° 21 
for Bl 562 17 1 
\Se pr 408 9 2 
3 for Bl Sc 892 6 
\pr 922 11 
fBl 3 a 
Lf \or Sc pr 6 
23 {Bl Sc 81 F 5 
>" \or pr ‘1 25 1 2 
{Se 84 
we \or Bl pr 105 
- Total 6764 5706 10582 2950 


The following percentages of recombination have been computed from 
the data in table 2. 


INTERVAL STANDARD ARROWHEAD CHIRICAHUA TREE LINE 
or-Bl 10.5 2.7 0.5 0.2 
Bl-Sc 17.3 0.5 0.03 0 
Se-pr 32.1 0.3 0.02 0 


The standard map distance obtained in our experiments between the 
loci furthest apart, or and cv, is 80.2 units, or slightly higher than the 
figure 68.4 units, obtained by Tan* from somewhat less extensive data. 
It may also be noted that Tan’s experiments. were conducted at 25°C., 
ours at 18°C. The location in the cytologically visible chromosome of 
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the genes used in our experiments can be inferred from the chromosome 
map published by Tan.’ As seen in the salivary gland cells, the third 
chromosome is a ribbon-like structure which for purposes of description 
has been subdivided into nineteen more or less equally long sections 
numbered 63 to 81, inclusively. The gene or lies in section 65 at a distance 
from the centromere of about one-tenth the length of the whole chromo- 
somes. . The centromere is in section 63. The gene cy lies somewhere 
in sections 79-81, close to the free end of the chromosome. The map 
distance of 80 units between or and cv therefore represents most of the 
genetic length of the third chromosome. Taking into qccount the fact 
* that some double crossing-over has remained undetected, and also that 
between one-tenth and one-fifth of the cytological chromosome contains 
no known genetic markers, the linkage map of this chromosome is probably 
less than 100 units long. 

The total frequency of recombination found in Standard/Arrowhead 
heterozygotes is about 4.0 per cent, in Standard/Chiricahua 0.5 per cent 
and in Standard/Tree Line only 0.2 per cent. Arrowhead differs from 
Standard by a single inversion which includes sections 70-76 or about 
one-third the length of the chromosome.‘ Tan’ places the gene Sc within, 
and pr outside and distal to the Arrowhead inversion, but his data regarding 
pr are inconclusive. Bl and or certainly lie between the inversion and the 
centromere. In Standard/Arrowhead heterozygotes less than one-fiftieth 
of the recombination normally taking place in the pr-cv interval is per- 
mitted, and about one-fifteenth of recombination between or and pr, most 
of it in the or-B/ interval. In other words, an inversion exerts a relatively 
greater suppressive effect on recombination in the part of the chromosome 
which lies distal to it (between the inversion and the free end of the chromo- 
some) than it does on recombination between the centromere and the 
inversion. This agrees with the data of Sturtevant* for Drosophila melano- 
gaster. 

The Chiricahua and Tree Line arrangements differ from the Standard 
each by a triple inversion which extends from about the middle of section 
68 to section 79, inclusive, or about six-tenths of the length of the chromo- 
some as seen in the salivary gland cells.‘ Our data show that recombina- 
tion in Chiricahua/Standard and Tree Line/Standard is almost wholly 
suppressed in the whole of the third chromosome. The small amount 
still permitted, less than six-tenths per cent, is concentrated between the 
genes or and Bi, a negligible amount being found elsewhere. Particularly 
noteworthy is the fact that crossing-over is very strongly suppressed 
between the centromere and the inversions, the result, probably, of inter- 
ference by the inversions with meiotic pairing.® No recombination 
whatsoever is detectable in the inverted part of the chromosome. (The 
three or pr flies recorded in table 2 among Standard/Chiricahua and 
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Standard/Tree Line heterozygotes are most likely the result of contami- 
nation; otherwise, they must represent triple crossovers, one between 
or and the inversion and a double to include the locus of pr; the individuals 
concerned were not tested further.) 

The amount of crossing over within the inverted parts of the chromosome 
cannot be determined from our data, since single crossing-over in para- 
centric inversions does not result in detectable recombination.!° However, 
single crossing-over in multiple inversions may result in deficiencies and 
duplications which preduce unviable zygotes. The Standard/Tree Line 
heterozygotes, for example, have a long paired region which extends from 
section 68 to 74 (see Fig. 3 in Dobzhansky‘). Crossing-over in this region 
would result in chromosomes which would act as lethals in zygotes. 
Whether such crossing over actually takes place is unknown; in neither 
case would viable chromosomes carrying gene recombinations normally 
borne in Standard and in Tree Line (or Chiricahua) appear. The possi- 
bility cannot be excluded that in inversion heterozygotes chiasmata may 
be localized in the immediate vicinity of the centromere, but crossing-over 
at the centromere would give no recombinations of the genes in Standard, 
Tree Line and Chiricahua chromosomes. 

Summary.—Heterozygosis for inversions found in the third chromosome 
in natural populations of Drosophila pseudoobscura reduces the frequency 
of recombination of genes located in the chromosome to a small fraction of 
the normal value. Recombination is strongly prevented not only for 
genes within the inverted sections, but also for those which lie between 
the centromere arti] the inversion, and between the inversion and the 
free end of the chromosome. Inversion is therefore a powerful means of 
holding together gene combinations which confer upon their carriers 
superior adaptive properties. 


1 Dobzhansky, Th., Evolution, 1, 1-16 (1947). 

2 Dobzhansky, Th., and Levene, H. Unpublished data. 

3 Unpublished data. 

4 Dobzhansky, Th., in Dobzhansky and EpKing, Carnegie Inst. Washington, Publ. 
554, 47-144 (1944). 

5 Helfer, R. G., Genetics, 24, 278-301 (1939). 

6 Tan, C. C., Ibid., 21, 796-807 (1936). 

7 Tan, C. C., Z. Zellforsch. u. mikroscop. Anat., 26, 439-461 (1937). 

8 Sturtevant, A. H., Carnegie Inst. Washington, Publ. 421, 1-27 (1931). 

9 Dobzhansky, Th., Am. Nat., 65, 214-232 (1931). 

10 Sturtevant, A. H., and Beadle, G. W., Genetics, 21, 554-604 (1936). 
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THE ROLE OF MUTATION AND OF SELECTION IN THE FRE- 
QUENCY OF MUTANTS AMONG MICROORGANISMS GROWN 
ON IRRADIATED SUBSTRATE 


By WILSON S. STONE, FELIX Haas, J. BENNETT CLARK, AND ORVILLE WySS 


LABORATORIES OF GENETICS AND BACTERIOLOGY, THE UNIVERSITY OF TEXAS, AUSTIN, 
TEXAS 


Communicated by J. T. Patterson, February 20, 1948 


The mutation rate in microérganisms is increased by irradiation or 
chemical treatment of the substrate.!:? The problem of differentiating 
between the selection of spontaneous mutants and the induction of addi- 
tional mutants is difficult, especially as the mutation rates are low and the 
growth rate of bacteria on treated substrate may be slower than normal. 
If selection is to explain the increase in frequency of mutants found in 
irradiated broth over that found in normal broth, it must increase the 
reproductive rate of mutants or inhibit it less than it inhibits the growth of 
normal cells. The increased number of mutants found in irradiated broth 
must be explained by induced mutation if selection is neutral or against 
the mutants. 

Experimental.—The procedures for irradiating the medium with ultra- 
violet light and determining the drug resistant mutants are essentially 
those employed in our earlier reports." In addition to the stock strain 
of Staphylococcus aureus (F.D.A. #209) the first experiments reported 
here involved a penicillin resistant strain and a stréptomycin resistant 
strain which arose as mutants when the stock strain was grown in irradiated 
broth. Identical inocula from 4-hour log phase cultures and also from 
24-hour cultures (older cultures do not give consistent results) of each 
strain were planted in plain nutrient broth (control) and in irradiated 
nutrient broth. Immediately after inoculation and again after five hours, 
samples from each bottle were withdrawn for plating to determine the. 
total count and the number of drug-resistant individuals in each population. 

In table 1 are given the number of mutants per million-organisms which 
are resistant to the indicated concentrations of the antibiotics, except for 
the figures in italics which show the total plate counts. The age of the 
inocula appeared to have little effect on the behavior of the organisms and 
the two portions of the table may be regarded as replicate experiments. 
Since these cultures produce a maximum of about 300 million cells per ml. 
under the conditions employed, it is evident that all cultures were still 
in the rapid growth phase at the 5-hour plating. The amount of radiation 
to which the broth was exposed was such that in no case did the culture 
growing in the irradiated broth fall behind the replicate growing in the 
control broth by as much as one division during the 5-hour growth period. 
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TABLE 1 


EFFECT OF IRRADIATION OF THE BROTH ON THE GROWTH AND MUTATION RATE OF 
NORMAL AND DRUG-RESISTANT S. aureus 








——————24-HR. INOCULUM 4-HR, INOCULUM 
CONTROL * IRRADIATED CONTROL IRRADIATED 
5 0 5 0 5 0 5 HRs. 


Normal Strain 


Total count, 2.4 75 1:8: *32 0.86 65 1.1 47 
millions* 

Penicillin, 13 19 17 344 7.0 4.6 9.3 40 
0.07 u/ml. 

Streptomycin, 65 41 79 310 48 23 46 185 
3.0 u./ml. 

Streptomycin, 42 12 36 123 10 1 2.8 40 
6.0 u./ml. 

Penicillin Resistant Strain 

Total count, 1.5 60 LiF 41 0.73 80 Lucas: G6 
millions* 

Penicillin, 18T 999T ift 850T 14:65T -1.8f 6T 14T 
0.07 u./ml. 

Streptomycin, 49 40 40 215 53 10 53 196 
3.0 u./ml. 

Streptomycin, 19 32 12 98 5.5 6.3 3:5: 28 
6.0 u./ml. 

Streptomycin Resistant Strain 

Total count, 0.78 68 0.82 53 £.a° OF 1.6 60 
mullions* 

Penicillin, 5.1 6.5 16 70 9.2 3.9 21 85 
0.07 u./ml. 


Streptomycin, 500T 280T 330T 226T 302T 78T 260T 180T 


3.0 u./ml. 
Streptomycin, 94T 66T 73T 12.8T 80 30 180 775 


6.0 u./ml. 


* Figures in italics are total count in millions determined by plating on nutrient agar. 
Other figures are the number of mutants (per million of the total count) which grew on 
the indicated antibiotic concentration: (T = thousand) 


Yet during this period the proportion of mutants in the normal population 
grown in the irradiated broth increased many-fold. 

It is clear that the penicillin resistant mutant resembles the normal 
strain in its rate of mutation to streptomycin resistance under the several 
test conditions. Likewise, the normal and streptomycin resistant strains 
have similar mutation rates to penicillin resistance. These rates, measured 
by rate of occurrence of organisms which grow at the indicated antibiotic 
concentrations, vary considerably but give a fairly consistent pattern. 
There is an increase in the number of resistant cells after growth on irra- 
diated broth. However, when the changes in penicillin resistance in the 
penicillin resistant strain and of streptomycin resistance in the streptomycin. 
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resistant strain under the several test conditions are compared, there is 
no consistent relation. In only a few instances did all the cells in the 
so-called resistant strains actually grow in the concentration of the anti- 
biotic employed. Sometimes more appeared under control conditions, 
sometimes more from the irradiated broth. This is not in harmony with 
the idea that irradiated broth either conditions or selects for resistant 
organisms. . 

Selection could account for the increase in numbers of mutants in the 
irradiated broth only if the normal bacteria were slowed down in repro- 
ductive rate or killed, while the multiplication of the resistant mutants was 
speeded up or at least not retarded to the same extent. The number of 
times that the populations increased during the 5 hours was computed 
from the data on table 1. These figures for the total population and for 
those fractions which grew in the presence of 0.07 unit per ml. penicillin 
or 3 units per ml. streptomycin are recorded in table 2. 


TABLE 2 
INCREASE IN 5 Hours OF THE ToTaL CELL COUNT AND OF THE MUTANTS EXPRESSED 
AS MULTIPLES OF THOSE PRESENT IN THE INOCULUM 


————24-HR. INOCULUM 4-HR. INOCULUM 
CONTROL IRRADIATED RATIO CONTROL IRRADIATED RATIO 








Normal Strain 


Total count* 31(5.0)  28(4.8) 0.9 76(6.2) 43(5.4) 0.6 
Penicillin mutants 45 (5.5) 575(9.2) 13 60 (5.7) 185(7.6) 3.7 
Streptomycin mu- 20 (4.2) 110(6.8) 5.5 36 (5.2) 175(7.4). 5 
tants (3.0) 
Penicillin Resistant Strain 
Total count 40 (5.4) 24 (4.6) 0.6 110 (6.8) 49 (5.6) 0.4 
Streptomycin mu- 33.(5.1) 128(7.0) 4 21 (4.4) 181(7.5) 8.6 


tants (3.0) 
Streptomycin Resistant Strain 
Total count 87 (6.5) 65 (6.0) 0.7 37 (5.2) 37 (5.2) 1 
Penicillin mutants 112 (6.8) 285(8.2) 2.5 16 (4.0) 150(7.2) 9.4 


* Figures in parentheses give number of bacterial generaticns required to produce 
5-hour population from inoculum, 


Growth of the normal and the streptomycin resistant strains in irradiated 
broth was only slightly less than in control broth but the penicillin mutant 
was inhibited to a greater extent. Both the penicillin resistant mutant 
and the streptomycin resistant mutant were originally isolated from 
irradiated broth. If selection is to account for the increase in number of 
organisms resistant to antibiotics in irradiated broth, these mutants should 
have a positive selection value in irradiated broth. This is not the case. 
Therefore, selection fails to explain the increase in ‘numbers of organisms 
resistant to antibiotics in the irradiated broth. 
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Table 2 shows the differential increase in resistant organisms ih irra- 
diated broth during the five-hour growth period. This is expressed both 
as multiplication of the original cells (normal and resistant) during that 

TABLE 3 


GROWTH AND MUTATIONS IN A MIXED CULTURE OF NORMAL AND MUTANT CELLS 








Gide I Tos, IRRADIATED———~ 
MUTANTS PER TOTAL IN MUTANTS PER TOTAL IN 
HRS. MILLION MILLIONS MILLION MILLIONS 


Normal Strain 











Penicillin, 0.07 u./ml. 0 20 1.3 16.7 0.7 
2 15.8 38 372 7.8 
5 9.8 102 316 38 
Penicillin, 0.20 u./ml. 0 0 1.3 0 0.7 
2 0 38 8.6 7.8 
5 0.4 102 4.7 38 
Penicillin Resistant Strain 
Penicillin, 0.07 u./ml. 0 826T 0.58 1000T 0.5 
2 1000T 16.3 992T 1.4 
5 895T 67 1000T 13 
Penicillin, 0.20 u./ml. 0 0 0.58 0 0.5 
2 0 16.3 203 1.4 
5 6.0 ae 238 13 
Mixed Culture (0.9 Normal + 0.1 Resistant) 
Penicillin, 0.07 u./ml. 0 58T | 48T 1.2 
; 2 41T 40 60T 5.3 
5 42T 96 58T 72 
TABLE 4 
GrowTH AND MUTATIONS IN A MIxED CULTURE OF NORMAL AND MUTANT CELLS 
—CONTROL TIRRADIATED-——— 
MUTANTS PER TOTAL IN MUTANTS PER TOTAL IN 
HRS. MILLION MILLIONS MILLION MILLIONS 
Normal Strain 
Streptomycin, 3.0 u./ml. 0 25.3 A: ee 29.8 0.8 
3 22.2 35 210 5.3 
Streptomycin, 10 u./ml. 0 0 0.8 0 0.8 
3 0.6 35 5.1 5.3 
Streptomycin Resistant Strain 
Streptomycin, 3.0 u./ml. 0 925T 0.8 965T 0.6 
3 385T 19.4 429T 3.1 
Streptomycin, 10 u./ml. 0 210 0.8 165 0.6 
3 66.8 19.4 "985 3.1 
Mixed Culture (0.9 Normal + 0.1 Resistant) 
Streptomycin, 3.0 u./ml. 0 106T ice 124T 1.0 
3 56T 36.0 104T 7:2 


period and as cell divisions necessary to give the net increase in numbers. 
The increase in number of resistant cells is of a consistently different order 
of magnitude from the differential increase in number of cells present in 
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itradiated and control media. Furthermore, the resistant mutants are 
similar to the normal strain in giving a differential increase in num- 
ber of mutants resistant to the second antibiotic. These differences 
are expected if some substance in the irradiated broth causes an increase 
in the mutation rate. They are not in agreement with an explanation of 
the increase of resistant organisms based on selection. 

Sitice it may be suggested that population dynamics in mixed cultures 
sometimes gives an unexpected advantage to one component, artificial 
mixtures of the normal and mutant organisms were prepared. The 
inoculum of the mixed cultures, which consisted of 9 times as many organ- 
isms from the normal strain as from the mutant strain, was planted in both 
irradiated and plain broth as were the requisite pure culture controls. At 
the time of inoculation and again 2 and 5 hours, or 3 hours later, the total 
count and the numbers of the mutants were determined (tables 3 and 4). 
It is evident that the mutants enjoyed no selective advantage in the 
irradiated broth. However, there was a definite and greater increase in 
the resistant strain of the mutants which grew in the higher concentration 
of antibiotics. 

Another line of evidence is derived from experiments in which organisms 
were left in contact with the irradiated broth for varying periods of time 
and then washed and removed to normal broth to eliminate the possible 
selective action of inhibitory principles derived from the radiations. The 
procedure which included controls to test the effect of the washing and 
transferring process is described below and is outlined in the footnote 
of table 5. 

Bottles of sterile unirradiated and irradiated nutrient broth (I A and 
II A) were inoculated with about 2 X 10° organisms per ml. from a log 
phase culture of S. aureus. After one-half hour a 10-ml. sample was 
withdrawn aseptically from each inoculated broth and placed in a sterile 
centrifuge tube; the bottles were incubated at 37°C. The samples were 
centrifuged and the sedimented organisms were washed with unirradiated 
broth and recentrifuged. The washed organisms were inoculated into 
bottles of unirradiated broth (I B and II B) from which samples were 
immediately withdrawn and plated in appropriate dilutions to determine 
the total numbers and the incidence of mutants. Further platings were 
made after 2- and 5-hour incubation at 37°C. 

When the cultures in bottles I A and II A had been incubated 1!/2 
hours, another 10-ml. sample was withdrawn from each and the centrifuging 
and washing process performed as above. The washed organisms from 
the 1'/.-hour cultures were inoculated into bottles of unirradiated broth 
(I C and II C) from which platings were made as before at 0, 2 and 5 hours 
after inoculation. In a like manner, a third sample was withdrawn from 
bottles I A and II A after they had been incubated 4 hours, but due to 
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TABLE 5 


EFFECT OF THE TIME OF CONTACT WITH IRRADIATED BROTH ON SUBSEQUENT GROWTH 
AND MUTATIONS IN UNIRRADIATED BROTH 


HOURS AFTER INOCULATION 
9 
4 











BOTTLE aaa b c a b c a b ¢c 
Experiment 1 
IA 2 6 32 ne Rear sei 65 88 177 
TILA 3 4 22 s eae ba 83 iJ 11 
1B 0.6 16 48 6 65 158 83 46 57 
IIB 0.7 6 25 10 6 13 79 6 9 
LG 0.8 69 159 24 50 115 63 103 374 
Li ¢ 5.8 +t il 32 3 21 84 8 17 
ID 0.8 100 186 12 82 262 63 35 145 
II D 1.0 7 30 8 ‘4 38 51 6 16 
Experiment 2 
IA 2 17 43 “ aes wat 55 300 102 
IIA 2 12 26 * sie Aap 96 1 8 
IB 0.3 39 48 15 46 31 47 110 58 
IIB 0.4 12 31 19 8 6 55 2 8 
peo 0.6 59 26 14 19 57 42 255 472 
sc 2 15 20 33 1 6 87 4 14 
ID ‘| 78 152 23 148 257 
IID 2 2 11 26 6 14 


a = total count per ml. in millions. 6b = mutants per million on 0.07 unit per ml. 
penicillin. c = mutants per million on 3 units per ml. streptomycin. 

IA. Bottle of irradiated broth inoculated with log phase culture of S. aureus. 

IB. Bottle of unirradiated broth inoculated with washed centrifuged cells which 
had grown in I A for 1/2 hour. 

IC. As IB but inoculum grown in I A for 1/2 hours. 

ID. As IB but inoculum grown in I A for 4 hours. 

II A, B,C, D. Same as the I series except that II A was unirradiated broth control. 


growth of the culture the washed sedimented cells were diluted before 
inoculating into the bottles of unirradiated broth (I D and II D) so that 
the inoculum was about 10° cells perml. The total count and the incidence 
of mutants in bottles I A and II A were determined immediately after 
inoculation and again after 5 hours’ incubation. 

The results of replicate experiments are given in table 5. 

It should be emphasized that the value of these tests lies in the fact that 
bacteria were exposed first to irradiated broth and then transferred to 
normal broth. The table shows that the technique 6f centrifuging the 
bacteria did not modify the results appreciably, for the two controls are 
essentially alike. Further, the same increase in number of mutants present 
is apparent in bacteria grown in irradiated broth whether or not they were 
centrifuged and then grown in unirradiated broth. Also this table shows 
that if a ‘‘high’’ percentage of mutants were present when the bacteria were 
transferred into normal broth (i.e., after four hours in irradiated broth) 
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this frequency was retained as the bacteria grew in normal broth. Further, 
if the bacteria remained in the irradiated broth for only a short period 
(1'/2 hours or sometimes even '/2 hour), the number of mutants present 
after growth in normal media increased quite markedly. The irradiated 
media had affected the bacteria so that the mutation rate, measured in 
their descendants grown in normal broth, was markedly higher after several 
cell divisions. Demerec* observed a similar effect from direct irradiation. 
These results are in agreement with the hypothesis that some material in 
the media, when utilized in the cell, causes mutations which can show in 
the descendants of the cell. The increase in number of mutants after 
transfer into normal broth is not in agreement with an explanation based 
on selection for resistant organisms by irradiated broth. 


TABLE 6 


EFFECT OF IRRADIATED BROTH ON THE OCCURRENCE OF BIOCHEMICAL MUTANTS 
From mannitol + to — 








CONTROL IRRADIATED———— 
NO. CELLS ° NO. CELLS 
STRAIN OF NO. PER NO. PER 
S. aureus TESTED MUTANTS MUTANT TESTED MUTANTS MUTANT 


Rosenbach 4,000 0 af 349 349 
801 1 801 328 es 

3,976 2 1988 1,064 : 355 

ses #3 ee 3,477 386 

6,816 3408 10,974 f 731 

465 Mi 4,350 4350 


Subtotal 16,058 30,542 1073 
Strain A 1,800 2,160 720 
Strain B 4,600 7,000 1400 
Strain C 2,100 v 4,320 1080 


Subtotal 8,500 8500 13,480 
TOTAL 24,558 4093 44,022 
From mannitol — to + 

1,403 0 3,666 
542 0 6,750 
7,448 0 4,000 

3,666 0 sme 

15,660 0 


TOTAL 28719 0 14,416 22 655 


The third series of experiments concern a different property of the 
organism, the ability to utilize mannitol. The normal strain of S. aureus 
used can ferment mannitol; it produces mutants which cannot. These 
mannitol negative organisms can in turn give rise to mannitol positive 
strains, presumably by reverse mutation. Table 6 shows that both the 
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direct and reverse mutation rate are increased by treatment with irra- 
diated broth. The several different strains tested acted essentially alike. 
Grown ‘in normal media, the combined mutation rate from mannitol + 
to — is one mutation in 4093; on irradiated media one in 1074; or roughly 
four times as great. No reverse mutation from — to + was discovered in 
28,719 tested organisms grown on unirradiated broth, but 22 were found 
in 14,416 (1 in 655) among organisms grown in irradiated broth. The 
tables show the fluctuations in frequency of mutants in the several samples 
as would be expected from a mutation phenomenon. Although it was 
not proved that these — to + were reverse mutations, this theory is con- 
sistent with the situation in higher organisms. Whatever may be the truth 
of that matter, it is exceedingly difficult to explain how irradiated broth 
could select both for and against the ability to ferment mannitol. 

Summary.—lIf selection is to explain the increased occurrence of mutants 
when S. aureus is grown in irradiated broth, the following conditions should 
be met: (1) Quantitative experiments should show that the mutants have 
a selective advantage over the normal strain under these conditions. This 
should be especially evident in a mixed culture. (2) When organisms 
are centrifuged from irradiated broth after a short exposure period and in- 
oculated into unirradiated broth, the increase in the number of mutants 
should cease. (3) Both forward and reverse mutations should not be 
differentially increased by growth in irradiated broth. The results of our 
experiments do not fulfill any of these conditions. The results are in agree- 
ment with the hypothesis that mutations are induced by some factor in 
the irradiated broth. 


1Stone, W. S., Wyss, O., and Haas, F., these PROCEEDINGS, 33, 59-166 (1947). 
2 Wyss, O., Stone, W. S., and Clark, J. B., J. Bact., 54, 767-772 (1947). 
3 Demerec, M., these PROCEEDINGS, 32, 36-46 (1946). 


ON THE AVERAGE VALUE OF ARITHMETIC FUNCTIONS 
By RICHARD BELLMAN 
DEPARTMENT OF MATHEMATICS, PRINCETON UNIVERSITY 


Communicated by S. Lefschetz, February 13, 1948 


The purpose of this note is to introduce a new method of estimating 
sums of the form 


me ys yl em)» (1) 


where f(m) is an arithmetic function such as the Euler totient function, 
¢(n), or the Dirichlet divisor function, d(n), and g(m) is either a polynomial 
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in m, a linear term of the form ap, + 5b, where p, is the mth prime, or an 
exponential term of the form ak” + b. 

We shall sketch the method in outline and then give the essential par- 
ticulars for the sum 


S= L o(Px(n)), (2) 


1snswN 


where 
o(n) = > d, (3) 


and P;(n) is a polynomial of degree k with integer coefficients. A general 
investigation of results of this type will be presented subsequently. 

The illustrious Indian mathematician, Ramanujan, was the first to appre- 
ciate the importance of the function 

Cy(n) oy exp(2x ipn/q), 1< pq, (4) 
’ q -_ 

in the analytic theory of numbers. He gave mamy interesting expansions 
involving these functions, of which the following are a sample: 


a(n) cs ae 
n 6 g 


a(n) = YX a(n) log 9/g 


Den) /¢ (5) 


The formulas seem to constitute an important link between the additive 
and multiplicative properties of numbers. There are several ways of 
deriving these striking formulas following methods of Hardy, Estermann, 
or Ramanujan himself. The author has derived another method which 
was used to state sufficient conditions for the expansion of a function in 
such a series. These results have been applied to estimation of the gener- 
alized Ingham sums of the form,' 


LL, fimfln + 8), (6) 


Returning to (1), we see that if 


ro} 


f(n) = 2d, agc,(n), (7) 


then . 


Efe = Ea} 


N 
S zu cain) (8) 


n= 


1 
ro N 

bis >. Ag tees { > mpoieany 
q=1 (¢,¢) = 1 = 











VoL. 34, 1948 MATHEMATICS: R. BELLMAN 151 


Sums of the form 


N 
E(N, g,q) = ody, exp(2mipg(n)/q) (9) 


have been the subject of considerable research in connection with Waring’s 
problem. They are also of great intrinsic interest. The following result, 
due to Hua,? is fundamental. 
k 
Lemma 1. Let g(n) = >> agx*, (ao, ai, ..., x) = 1. Then 


l=0 


q 


| ‘ . 

EWN, gq) -~ ¥ ‘exp( 2778) | =o¢-"**), (10) 
dn=1 qd 

where the constant implied by 0 is independent of N, p, and q, depending only 

on k and «. 

The estimate is most useful for g << N. Hence, the sum for f(g(m)) is 
broken into two parts, (1, VN), and(N +1, ©). Thesum from NV + 1 to 
© is estimated trivially, if \a,| is small enough asg— ©. For the case 
of d(n) where the series is no longer absolutely convergent, difficulties arise. 

We now use the above a earnest to prove 


Tueorem. Let g(n) = >> ax’, (ao, a1, ...,a%) = 1. ThenasN- 
=o 





N k+l © q 
D, oe) ~ oS = ey = igh es exp(2xipe(n)/a) (11) 


The case where (do, a1, ..., @) > 1 may be treated similarly, requiring 
only slight attention to details. 
Proof: We treat }>,0(g(m))/g(m) first and then use partial summation. 


o(g(n)) _ 2? XY e(g(n)) , mS calg(n)) 
ge) Gers @ + Owes se 
= P(n) + R(n). 
Let us estimate R(m). It is known, elementarily, that 
c(n) = >) du(n/8) (13) 
5\(q, m) 

Hence 
lcelm)| Se 8 = olan) S @ m)d(@ n)) (14) 

since 
o(n) = Yibsn 1K nd(n). (15) 


b\n b\n 
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Thus 
IR] < FD @ a(m))d((g, e(m))/a" (16) 
IN+1 
< 4 5 9 
Cine ek, NS ee, A ae tae am, 
6 alg) (a,g@)) = 0 

— ll d(a) ( 1/,2| 
~ 6 olgn) CG ([N+ 1/6] +2 <q <o@ j 


Considering separately the two cases o > ~/N, o < WN, we see that 
|R| is less than 


cod*(g(n)) 
/N 


since d(nm) = O(n‘) for any e> 0. Thus it is clear that 


= o(1), (17) 


N 
2d, R(n) = o(N). (18) 


n= 
N 


* Using Hua’s theorem, Lemma 1, in connection with >> P(n); and then 
n=1 


summation by parts, we obtain the result stated in Theorem 1. 

To obtain results for primes, we require estimates for trigonometric 
sums involving primes. For exponential terms, results concerning primi- 
tive roots are needed. 


1 Bellman, R., ‘(On Ingham Sums and Ramanujan Expansions,’’ unpublished. 
2 Hua, L. K., ‘On an Exponential Sum,”’ J. Chin. Math. Soc., 2, 301-312 (1940). 


AN INVERSION AND REPRESENTATION THEORY FOR CON- 
VOLUTION TRANSFORMS WITH TOTALLY POSITIVE KERNELS 


By I. I. HIRSCHMAN, JR., AND D. V. WIDDER 
DEPARTMENT OF MATHEMATICS, HARVARD UNIVERSITY 


Communicated by Hassler Whitney, February 7, 1948 


Let b, {a:}f be real constants subject to the sole restriction that 


to ) 


> (1/ax)? < ©. It is easily verified that the formula, 


1 





Gh=-= i : r d 
ae SE cs cd s/ap S; 
1 


a 
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defines G(t) as a real-valued function of ¢ for (—~7 <t< ©), and that 
G(t)eC*. The problem of inverting the convolution transform with 
G(t) as kernel, ‘ 


f(x) = S% G(x — Held, (2) 


has been considered by D. V. Widder.! In a note? in these PROCEEDINGS 
he announced that if ¢(¢) « Z and if x is a point of continuity of ¢(¢) then 


fim P,(D)f(x) = (x), (3) 
where 


n D ; 
P,(D) = e®-*)? TT (1 ee =) ee /4k 
=1 


the b, being any real constants such that lim 6, = 0. Here the symbol 
n> © 
D represents the operation of differentiation and e”/* the operation of 
translation through a distance 1/a. In preparing the detailed proof of 
this result the authors discovered it could be much improved. It is the 
purpose of the present note to state these more general results. We have 
succeeded in removing all restrictions from ¢(t) except those required to 
make the integral (2) converge. We have also been able to treat the 


Stieltjes case, 
f(x) = JX G(x — te*da(s), (4) 


where ¢ is an arbitrary real constant, which is inverted, as was to be ex- 
pected, by 
lim Jf, e~“P,(D)f(x)dx = a(x2) — a(x). (5) 
n> © 
I. J. Schoenberg’ has studied the functions G(¢) in considerable detail, 
and some few of our results, although obtained by different methods, over- 
lap with his. When this is the case the priority is his. The functions 
G(t) defined in (1) are examples of what he has called ‘‘totally positive” 
functions. They do not constitute all such functions, but they virtually 
do so, and we have therefore borrowed his suggestive terminology. 
A kernel G(t) belongs to Class I if there are both positive and negative 


a,’s; G(t) belongs to Class II if there are only positive a,’s and if })(1/ax) = 
1 


©; G(t) belongs to Class III if there are only positive a,’s and if }) (1/a,) < 
1 


o, Either G(t).or G(—?#) belongs to one of these classes. 
We shall briefly indicate the new properties of G(#) which we have 
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established. Theorem 1 may be obtained as a special case of more general 


results due to Schoenberg. 
THEOREM 1. Jf G(t) ¢s defined as in equation (1), then G(t) is a frequency 


function with mean b and variance >°(1/a,)?; moreover, G(t) is bell shaped, 
1 


4.e., (d/dt)"G(t) has precisely n changes of sign. 

THEOREM 2. Jf Gi(t) and G2(t) are defined as in equation (1) with con- 
stants b, {a;},° and c, {di}1° respectively, and if the {d;};° area subset of the 
{a;}1” , then G,(t) — G2(t) has at most two changes of sign. 

THEOREM 3. If G(t) is defined by equation (1) and if G(t)e Class II then 
for L = L(t) defined as a function of t re the equation, 


a ES 5, ay (ay en L)”’ 
we have for all t sufficiently large, t > T, 


5 log G(t) = L(t + 6 + o(1)), 
where o(1) represents a function of t which approaches zero as t becomes 
positively infinite. 

Using these theorems and other results familiar in specific cases from the 
theory of the Laplace‘ and Stieltjes® transforms we have been able to prove 

THEOREM 4. Jf G(t)e Class I, then if the transform (4) converges for any 
one value of x it converges for all values of x. Moreover, if x; and x2 are 
points of continuity of a(t), then the inversion formula (5) is valid. If 
c > Min [a,], then a(+) exists and x2 may be takenas +o. Ifc & 

ap>0 

Max [a,], then a(— ©) exists and x; may be taken as — ©. 
ap <0 

TuHeoreM 5. If G(t)e Class II, then if the transform (4) converges for any 
one value of x, it converges for all larger values of x so that tts region of con- 
vergence is of the form y.< x < ©. [The transform may or may not con- 
verge at the abscissa of convergence y-.| Moreover, if x; and x2 are points of 
continuity of a(t), then the inversion formula (5) is valid. If c 2 Min [ax], 
k = 1,2, ..., then a(+ ©) exists and x. may be taken as + ©. 

THEOREM 6. Jf G(t)e Class III, if a(t) is of bounded variation in every 
closed finite subinterval of T< t < ~, and tf the transform (4) converges for 


some one value of x > T + 6 + >¢(1/a,), then it converges for all such x. 
1 


Moreover, if x; and x2 are points of continuity of a(t), then the inversion 
formula (5) is valid. If c 2 Min [a,),k = 1, 2, ..., then a(+ ©) exists 
and x. may be taken as +. . 

Analogous theorems may be stated for the convolution transform (2), 
which is inverted by the operator (3) for every point of continuity x of ¢. 
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We may further improve these results. Let us list some conditions on 
the a,’s and },’s. 


) ad 3/2 
A. |a,|-* = 0 | >» a] ’ 


ae i n+l 


© V/s 
B. bn = of D> ao] ’ 


n+i1 


co 1/3 
C. bh, = o| : a| . 
s+ 1 
If a(t) is supposed to be normalized, a(#) = 2-'[a(t+) + a(t—)], and if 
conditions A and B are satisfied, then (5) holds even for points of dis- 
continuity. If condition C is satisfied, then (3) holds for all ¢ in the Le- 
besgue set of ¢(/) and hence almost everywhere. 

These results include as special cases, and in a few instances go beyond, 
the results obtained for the Laplace and Stieltjes transforms, . their 
iterates,» 7 the Meijer transform,’ etc. They are substantially best 
possible results. 

The authors have further constructed a comprehensive representation 
theory. Such theories have been constructed for the Laplace and Stieltjes 
transforms by D. V. Widder,” ® for the iterated Stieltjes transform by 
R. P. Boas, Jr., and D. V. Widder® and for the third iterate of the Laplace 
kernel by E. J. Akutowicz." Dr. Boas® has further announced but not 
published a similar theory associated with the Meijer transform. The 
present theory includes all these results as special cases. Since the state- 
ments of the theorems are rather long, and since they can be readily sur- 
mised by anyone familiar with the foregoing particular theories, we ‘shall 
state only one typical result. 


TuHeEoreEM 7. Jf G(t)e Class II then the conditions 


qT. f(x)e i (ye < x< o), 
2. f(x) =0 ("*) («> &), 


where a. = Min [a] ,k = 1, 2, bh 





3. PAD)f(x) 20 (ve -b+bn- Y Va<x< @)n =1,2,. 
1 
are necessary and sufficient that 
f(x) = S22 G(x — tda(t), 


with a(t) non-decreasing and with abscissa of convergence less than or equal 
lo Yc. 


1 Widder, D. V., Duke Math. J., 14, 217-251 (1947). 
* Widder, D. V., Proc. Nat. Acap. Sct., 33, 295-297 (1947). 








156 MATHEMATICS: G. W. MACKEY Proc. N. A. S. 


3 Schoenberg, I. J., Jbid., 33, 11-17 (1947). 

4 Widder, D. V., Trans. Am. Math. Soc., 36, 107-200 (1934). 

8 Widder, D. V., Ibid., 43, 7-60 (1938). 

6 Pollard, H., Duke Math. J., 14, 129-142 (1947). 

7 Pollard, H., Jbid., 14, 659-675 (1947). 

8 Boas, R. P., Jr., Proc. Nat. Acap. Scr., 28, 21-24 (1942). 

9 Boas, R. P., Jr., and Widder, D. V., Trans. Am. Math. Soc., 45, 1-72 (1939). 

10 Akutowicz, E. J., ‘‘The Third Iterate of the Laplace Transform,’’ Thesis, Harvard 
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THE LAPLACE TRANSFORM FOR LOCALLY COMPACT ABELIAN 
GROUPS 


By GEORGE W. MACKEY 
DEPARTMENT OF MATHEMATICS, HARVARD UNIVERSITY 


Communicated by Oscar Zariski, February 13, 1948 


Introduction.—As has been shown by A. Weil! and others the classical 
theory of the Fourier transform for functions defined on the real line has 
a natural generalization to a theory of a transform taking functions defined 
on a locally compact Abelian group G into functions defined on the charac- 
ter group G of G. This theory includes as special cases the theory of 
n-tuple Fourier series and the theory of m dimensional Fourier transforms. 
It is based upon the’ very simple idea of replacing the function exp (ity) 
which occurs in the classical one-dimensional Fourier transform by the 
character function (t, y) whose value for each ¢ in G and each y in G is the 
value at ¢ of the character y; and thus defining the Fourier transform of a 
suitably restricted f(t) as /f(t)(t, y)dt where integration is with respect 
to the Haar measure in G. 

It is the purpose of the present note to describe how this idea may be 
further exploited so as to generalize the Laplace transform and the Laurent 
series in an analogous fashion and thus to connect certain aspects of 
complex variable analysis with the theory of topological groups in the same 
way that this is done for real variable analysis by the generalized Fourier 
transform. It is hoped that ultimately such considerations will throw 
light on the theory of functions of several complex variables. 

At the present time we are in a position only to describe the basic notions 
and state a few preliminary results. We expect to publish later in another 
journal a paper giving detailed proofs of the theorems stated here as well 
as those of others which we hope to obtain. 

Real Characters, Linear Functionals and One-Parameter Subgroups.— 
We shall call a continuous complex valued function z, defined on the 
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locally compact Abelian group G, a generalized character if 2(tt2) = 
2(t)2(te) for allt; and fin G. It is evident that every generalized character 
has a unique representation as the product of two generalized characters, 
one of which takes its valiies on the positive real axis and the other on the 
unit circle. The generalized characters of the latter sort are clearly identi- 
cal with the characters of the Pontrjagin-van Kampen duality theory and 
will be referred to as ordinary characters. Generalized characters of the 
former sort will be called real characters. If r is a real character then the 
function x obtained by setting x(t) = log (r(¢)) has the property that 
x(tite2) = x{t,) + x(b) for all 4, tin G. A continuous real valued function 
with this property will be called a linear functional. Clearly the above- 
described mapping sets up a one-to-one correspondence between the real 
characters and the linear functionals on G. Now let x be any linear func- 
tional on G. For each real number u the function taking ¢ in G into the 
complex number exp (iux(t)) is clearly an ordinary character y(u). More- 
over y(u% + ue) = y(u1)y(u2) and y is a continuous function of u. Thus 
y(u) is a continuous homomorphism of the additive group of the real line 
into G. Such a homomorphism is called a one-parameter subgroup of G. 
It may be proved without great difficulty that every one-parameter 
subgroup of G may be obtained from a-linear functional in the manner 
just described and that there is thus obtained a one-to-one correspondence 
between linear functionals on G and one-parameter subgroups in G. We 
shall find it convenient to use the same’ symbol to denote a linear functional 
and the corresponding one-parameter subgroup and to call the compound 
object so denoted a vector. If x is a vector, ¢ a member of G and wu a real 
number, then x(#) will denote the value at ¢ of the linear functional asso- 
ciated with x and x[u] will denote the member of G assigned to u by the 
one-parameter subgroup associated with x. We convert the set of vectors 
into a real vector space by carrying over the obvious vector space operations 
in the set of linear functionals. We note that (x; + x2)[u] = x[u]xe[u] 
and u,x|u]=x[uu] whenever x, x, and x2 are vectors and u, mu and wu are 
real numbers. The natural question concerning the existence of vectors 
is answered by the 

THEOREM. Let G be a locally compact Abelian group and let G be its 
character group. Then the following statements about G and Gare equivalent, 

(a) For each ¢ in G distinct from the identity e there exists a linear 
functional x with x(t) ¥ 0. 

(6) For each ¢ in G distinct from e there exists a real character r with 
r(t) ~ 1. 

(c) The union of the images of all one-parameter subgroups of G is 
dense in G. 


(d) G is connected. 
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(e) G is the direct product of a discrete torsion free group and the 
additive group of an m dimensional vector space. 

The Laplace Transform.—We define the generalized Laplace transform 
by replacing the ordinary characters of the Fourier transform theory by 
the generalized characters described above. The Laplace transform of a 
suitably restricted function? f defined on G is then /f(t)(t, z)dt where 
(t, 2) is the value at ¢ of the generalized character z. In view of the re- 
marks of the preceding section we may write this as /f(t) exp (x(t))(t, 
y)dt where y is an ordinary character and x is a vector. The result may 
be regarded as a function defined in the direct product of the space G of 
all vectors with G. If G is taken to be the additive group of integers then 
G is the multiplicative group of complex numbers of modulus one and 
G may be put in to one-to-one correspondence with the positive real 
numbers in such a way that the positive real number r corresponds to the 
linear functional which carries n into n log (7) and hence to the real charac- 
ter which carries n intor". Thus the Laplace transform in this case may be 
regarded as mapping doubly infinite sequences {a,},n = +1, +2,..., into 
functions defined in the complex plane with the origin removed. It is 
readily seen to coincide with the corresponding mapping defined by the 
Laurent series. If G is taken to be the additive group of real numbers, 
G X G turns out to be isomorphic to the additive group of all complex 
numbers and the transform considered here reduces to the classical doubly 
infinite Laplace transform. Taking finite direct products of these groups 
with themselves we obtain multiple Laplace transforms and Laurent series 
in several complex variables. It follows from the theorem of the pre- 
ceding section that there is little if any point in considering groups for 
which G is not connected. With this in mind we may say that roughly 
speaking the generalized Laplace transform defined above is the “direct 
product”’ of the classical m-fold doubly infinite Laplace transform and a 
transform which generalizes a multiple Laurent series to the same extent 
that a general discrete torsion free group generalizes a direct product of 
finitely many replicas of the additive group of the integers. 

There are many questions that may be asked about the nature of the 
subset of G X G in which £ f(t) exp (x(t))(t, y)dt is defined. The answers 
to these depend, of course, upon the sense in which the integral is to exist. 
We shall remark here only that if the integral is required to be absolutely 
convergent then the region of existence is the direct product of a convex 
subset of G with the whole of é. This, of course, implies at once the well- 
known facts about the regions of absolute convergence’ of Laurent series 
and doubly infinite Laplace transforms. 

Analyticity of the Laplace Transform.—Partial differentiation of functions 
defined on a topological group may be defined by means of one-parameter 
subgroups. If f is a function defined on G and x is any member of G the 
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partial derivative f, of f with respect to x may be defined as the limit as 
u tends to zero of (1/u)(f(yx[u]) — f(y)) provided that this limit exists in 
some appropriate sense. If f is the Fourier transform of a suitably re- 
stricted function k on G then f, will be the Fourier transform of the function 
gon G such that g(t) = x(#)R(t) for all ¢ in G. 

Since G is a real vector space it is clear that when it is provided with 
any reasonable topology there will be a natural one-to-one correspondence 
between its elements and its one-parameter subgroups and, accordingly, 
such a correspondence between the elements of G X G and the one-param- 
eter subgroups of G X G. Thus if (x, x2) is a member of G X G and 
F a suitably restricted function defined in G X G we may speak of the 
partial derivative F(,,, ,,) of F with respect to (*, x2). It is clear that 
if u is any real number then the partial derivative of F with respect to 
u(x, X2) = (ux, UxX2) is equal to the product of u and the partial derivative 
of F with respect to (x1, x2). Now GXG may be converted into a complex 
vector space by defining (uw + iv)(x, x2) to be the element (wx, — v%x2, 
vx, + UX) but it will, of course, not always be true that the above homo- 
geneity relationship will continue to hold for complex multipliers. Given 
any class of differentiable functions those members F for which F(,,, ;,) 
is a complex homogeneous* function of {x;, x2) will be referred to as the 
analytic functions in that class. 

As is to be expected there are theorems connecting the properties of being 
analytic and of being a Laplace transform. We shall close this section by 
describing such a theorem. Let f be any function in Z? on G. Then, 
as may be proved without difficulty, the set of all x in G for which f(#) exp 
(x(¢)) is in L? on G is a convex subset K, of G and contains the zero element. 
We shall call a convex subset of G large if it contains the zero element 
and the smallest linear subspace of G containing it is the whole of G. 
We shall call a point x in a large convex subset of G an interior point if, 
for some real number u greater than one, the point ux is also in the convex 
subset. A function f for which K;, is large will be said to be strongly in 
L?. If f is strongly in L? then the function on K, X G which for each x 
in K, is defined for almost all y in G as the Fourier transform of f()exp(x(t)) 
we shall call the L? Laplace transform of f. Now let K be any large convex 
subset of G and let F be a function defined on K X G so as to be in L? on 
G for each x in K. Then for each x in the interior of K the expression 
(1/u)(F(x + ux, yx2[u]) — F(x, y)) may for each (x1, x2) be regarded as a one- 
parameter family of elements of L? on G defined for u sufficiently small. 
If this family tends to a limit as u approaches zero for all x interior to K 
and all (x1, x2) in G X G and if this limit F(z, ») is complex homogeneous 
as a function of (x1, x2) we shall say that F is L? analytic interior to K X G. 
These definitions having been made, our theorem may be stated as 
follows. 
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TuroreM. Let F be defined on K X G where K is a large convex subset 
of G. Then F is L? analytic interior to K X G if and only if there exists a 
function f strongly in L? on G such that K, contains the interior of K and F 
coincides* interior to K X G with the L* Laplace transform of f. If f does 
exist it is unique (modulo functions zero almost everywhere) and F(x, 2) 
is also analytic and is the L? Laplace transform of the function g such that 
g(t) = f(t) + txt). 

Complex Integration and the Cauchy Formula.—If F is the Laplace 
transform of a function f on G then for each x in G it is the Fourier transform 
of f(t) exp (x(t)). Thus (formally) (F(x, y)/(t, y))dy = f® exp (x() 
so that (F(x, y)/(exp (x(t))(t, y)))dy = f(t). In other words, for suitably 
restricted functions, f may be recovered from F by integrating F(x, y)/ 
(exp (x(t))(t, y)) over any subset of G X G which lies in the region of 
definition of F and is of the form x X G; the measure being the Haar 
measure in G transferred by the one-to-one mapping y — %o, y. It follows 
that F is determined by its values on any subset of the form x» X G and 
we are led to expect a formula giving F(x, y) in terms of F(x, y). If we 
follow through the formal work of finding f from F(x, y) and F(x, y) from 
f and replacing the Fourier transform of a product by the convolution of 
the Fourier transforms we get the formula F(x, y) = J F(x, y1)K(x — xo, 
yyi—')dy, where K (x, y) is the Fourier transform of exp (x(#)). Un- 
fortunately, this formalism cannot be made to lead to a theorem since the 
Fourier transform of exp (x(t)) does not exist. Suppose, however, that 
¢ is the characteristic function of a set so small that ¢(¢) exp (x(t)) does 
have a Fourier transform for all x in a suitably large subset of G. Then 
if f vanishes outside of this set so that f = $f the procedure described 
above leads to the formula F(x, y) = SF(xo, v)Ko(x — x, yi ')dy 
where K,(x, y) is the Fourier transform of; ¢(¢) exp (x(t)). This formula 
may be proved rigorously for suitably restricted functions F. In the case 
in which G is the additive group of the integers and ¢ is the characteristic 
function of the non-negative integers it reduces to the classical Cauchy 
integral formula for the special case in which the closed curve is a circle 
with center at the origin. 

If G is finite dimensional (and probably with even a weaker restriction) 
it is possible to extend the above considerations to the case in which the 
integration is performed over a much more general subset of G X G than 
one of the form x X G. Let y be a suitably differentiable function de- 
fined throughout G and having values in G. Let C, be the set of all points 
of G X G of the form ¥(y), y for yeG. Then a complex valued measure 
My may be introduced into C, which is of such a nature that theorems 
can be proved establishing the formulae: 


ft) = Sey (Fe, y)/(exp (x(t))(t, y)))duy 
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and 
F(x, y) = Sey F(%, )K(x — x, yy" )dpy 


where, in the second integration, x and y are regarded as fixed. As the 
work necessary to remove obviously too restrictive conditions from the 
statements of such theorems has not yet been done we shall confine our- 
selves here to a description of the measure y,. Let @ be the function taking 
y in G into yy), y in G XG. Then @ is a one-to-one map of G onto Cy 
and hence wy is completely determined by the measure in G defined by the 
equation fy(Z) = y,(0(Z)). When y is sufficiently differentiable by will 
be absolutely continuous with respect to the Haar measure in G and hence 
will be completely described by its Radon Nikodym derivative with re- 
spect to this Haar measure. This derivative may be computed from 
y as follows. For each x in G the partial derivative y, of y with respect 
to x may be defined in a manner strictly analogous to that used for complex 
valued functions. The result will be linear in x and will be for each y 
in Gamember of G. Thus for each y in G, y,(y) defines a linear trans- 
formation T,(y) of the vector space G into itself. It may be extended to 
a complex linear transformation of G X G into itself in a unique and obvious 
fashion. The sought for Radon Nikodym derivative 7 is defined by the 
equation: r(y) = (1/i) determinant (iJ + _T,(y)). Here J represents 
the identity transformation and n is the dimension of G. Presumably a 
measure with similar properties may be introduced into C, when y is only 
assumed to have some property analogous to being of bounded variation. 
How this is to be done is a matter which we expect to investigate. 

Concluding Remarks.—Among other results in our possession at the 
moment is a generalization to functions on G X G of Hadamard’s three 
circles theorem. Its proof may be carried out using the same device used 
by Zygmund and Tamarkin® in proving Thorin’s generalization of the M. 
Riesz convexity theorem. This is not surprising once one has observed 
that Thorin’s theorem is essentially the Hadamard three circles theorem 
for functions of m complex variables. We have also investigated the 
extension to groups of the Hilbert transform and have used this extension 
to obtain formulae giving F(x, y) in terms of its values on a set of the form 
x0 X G which are different from those discussed in the preceding section. 
These and other matters we intend to discuss in full in the more detailed 
paper promised in the introduction. 

1 Weil, A., ‘“L’intégration dans les groupes topologiques et ses applications,” Actualités 
Scientifiques et Industrielles, No. 869, Paris, 1940. 

2 Functions mentioned in this note are complex valued except when the contrary is 
explicitly assumed. 

’ This particular way of formulating analyticity was suggested to the author by the 


treatment of complex manifolds in an unpublished manuscript of C. Chevalley. 
* Coincidence here means, of course, having the same value for almost all y for each x. 
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5 Cf. Bochner, S., “Group Invariance of Cauchy’s Formula in Several Variables,”’ 
Ann. Math., 45, 686 (1944). 

6 Tamarkin, J. D., and Zygmund A., ‘‘Proof of a Theorem of Thorin, Bull. Am. Math. 
Soc., 50, 279 (1944). 





ON THE ASYMPTOTIC DISTRIBUTION OF THE SUM OF A 
RANDOM NUMBER OF RANDOM VARIABLES 


By HERBERT ROBBINS 
DEPARTMENT OF MATHEMATICAL STATISTICS, UNIVERSITY OF NORTH CAROLINA 
Communicated by Marston Morse, January 9, 1948 


We shall state without proof some results on the asymptotic distribution 
as \ — © of the sum 


Y= X,+...+Xy 


of a random number of independent random variables, where the X, have 
the same fixed distribution function F(x) = P[X, < x] and where N is 
a non-negative integer-valued random variable independent of the X,, 
whose distribution function depends on a parameter }. We shall use 
the notation 
a = E(X)), ce? = Var(x;(0< c? < &), 

a = E(N), y? = Var(N)(0 & y2< &), M = (N-a)/7, 

6(t) = E(e*™), oc = ac’? + ya’, 6 = (ya)/o, 

Z = (Y — E(Y))/WVar(Y) = (Y — aa)/o, — gt) = E(e#). 





THEOREM 1. Jf as\ 7 @ 
oe 5 emer 0(o°), (1) 


then for any t,as\ > @ 


g(t) = (the 77°°-™ + (1). (2) 
Corotiary 1. Jf (1) holds andifas\— & 
a*y’? = o(a), (3) 
then for any t, 
lim g(t) = e~”™, (4) 
A> @ 


so that Y is asymptotically normal. 
Coro.iary 2. If (1) holds and if M has a non-normal limiting distribu- 


tion function G(x), so that 
lim 0(t) = g(t) # e~/™, (5) 
\—> © 
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and if 
2 
lim “— = s(0 ¢ s< &), (6) 
A wo "A 


then 





t 
Vil+s 
so that Z has a non-normal limiting distribution function. 

It is easy to show that if M has a limiting distribution function G(x) 
such that G(x) >0 for every x, then as\ > ©, a— @ and y = O(a), 
so that (1) holds. 

Corotiary 3. Jf N is asymptotically normal then Y is asymptotically 
normal. 

An example in which (1) does not hold is the following: for any positive 
integer \ let N have the values \, 2\ with P[V = A] = P[N = 2A] = 1/2, 
and leta = 0. Then o? = (3dc?)/2, y = \/2, y + o(0?). Here we may 
show directly that 


tim (0) = ef Jenene = 


lim y(t) = %/2{e77*" + e~"}, 
»\—> © 


which is the characteristic function of a mixture of two different normal 
distributions. This is a special case of the following theorem. 
THEOREM 2. If 


a = 0, lim y/a = r(0<r <?o), (8) 


A> © 


and if M has a limiting distribution function G(x) (necessarily such that 
G(x) = O for some x), then 


lim y(t) = f."e~”*"*dGi(x), (9) 
\-—> @& 
where 


Gis)'='G =) (10) 


It follows that Z has the limiting distribution function 


. x 
H(x) . I, Hy (5. ic (11) 


Ho(x) = ze Zz du (12) 


is the normal distribution function with means 0 and variance 1, 
A full account of these results will be published elsewhere. 


where 
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ON VARIATION-DIMINISHING INTEGRAL OPERATORS OF 
THE CONVOLUTION TYPE 


By I. J. SCHOENBERG 


DEPARTMENT OF MATHEMATICS, UNIVERSITY OF PENNSYLVANIA 


Communicated by Oswald Veblen, February 20, 1944 





A real matrix A = || aux i (=J,...,n;k = 1, ..., m), is said to be 
totally positive if all its minors, of any order, are non-negative. The 
algebraic properties of totally positive matrices have been intensively 
studied by F. Gantmakher and M. Krein in 1937.1 In 1930 the author? 
showed that if A is totally positive, then the linear transformation, 


™m 


vu = p> GQgx, (t = 1, ...,), (1) 
=1 
is variation-diminishing in the sense that if v(x,) denotes the number of 
variations of sign in the sequence {x,} and v(y,) the corresponding number 
in the sequence {y,}, then we always have the inequality 


v(94) S v(x). (2) 


In a previous note*® the author has defined the class P of the so-called 
Polya frequency functions as the class of functions A(x), -7 <x < o, 
satisfying the following three characteristic conditions: 

1. A(x) is measurable. 

2. lfa<xe<...< x, andh<th< ... < ty, then the determinant of 
order n, whose element in the 7th row and jth column is A(x; — #,), should 
be non-negative, i.e., 


det 


IV 
S 








This inequality should be verified for m = 1, 2, 3, .... 
3. Finally 


0< f-% A(x)dx << +o, 


All Polya frequency functions A(x) turn out to be everywhere continuous 
with the single exception of the so-called truncated exponential 


e-* ifx 2 0 
siniills of if x < 0, 


and all functions arising from it by a change of scale and origin. As stated 
in the same note, a function A(x) is equivalent (or equal almost everywhere) 
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to a Polya frequency function if and only if it has a bilateral Laplace 
transform of the form 


© 1 ; 
y e~ *A(x)dx = Vs) ,(-—a< Rs<a,forsomea>0O), (4) 
where W(s) is an entire function of the type II of Laguerre, Polya and 
Schur‘ 


V(s) = Ce~ Vt TT (1 + &s)e™, (4’) 
vy=1 


(where C >0, y 2 0, 6, 6, real, O< y? + 26,?< o—), the relation (4) holding 
in a certain open vertical strip of the s-plane containing the imaginary axis. 
Accordingly 


A(x) = e~*’, e~*!, e&-e*, 1/cosh x, 


are a few examples of Polya frequency functions because their bilateral 
Laplace transforms are readily found to be of the form (4), (4’). 

The analogy between the property (3) of a Polya frequency function 
and the definition of a totally positive matrix A, as given in the first 
paragraph, would suggest that the convolution transformation, 


g(x) = JUA(x — Af (Ode, (5) 


should be variation-diminishing, provided A(x)eP. For this purpose we 
require the following two definitions. 

I. Let f(x) be a real function defined for all real x. The number o(f), 
of variations of sign of f(x) in the range (— ©, ~), is defined as follows: 
If S: x1 < x. < ... < x, is an arbitrary finite increasing sequence of real 
numbers, then 


nf) = sup v(fx)), OSvf)S ). 


II. Let A(x) be real and measurable for all real x. We say that the 
transformation (5) is variation-diminishing if and only if A(x) enjoys the 
following two properties: 

i. 

0< f.% |A(x)|dx < @. 

2. If f(t) is bounded and measurable, then (5) should always imply the 

inequality 


o(g) S v(f). (6) 


The following theorem holds. 
THEOREM. The convolution transformation 
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g(x) = fi A(x — df(Odt (5) 


is variation-diminishing if and only if the function A(x) admits of a bilateral 
Laplace transform of the form 


F e **A(x)dx = aan (7) 


where ¢€ = +1 and ¥(s) is an entire function of the Laguerre-Polya-Schur 
type (4’), the relation (7) being valid in a certain open vertical. strip of the 
s-plane containing the imaginary axis. An equivalent formulation 1s as 
follows: (5) is variation-diminishing tf and only if eA(x)(e = +1 or else = 
—1) is almost everywhere equal to a Polya frequency function: 

If (5) is variation-diminishing, we always have v(g) S v(f), even if f(t) is 
not bounded, provided f(t) is summable in every finite range and such that the 
improper integral (5) converges for every value of x. In particular, this ts 
always the case if f(t) is a polynomial. 

The variation-diminishing property of the classical transformation, 


ge) = i © ee fod, 


has previously been established by Polya® in connection with related 
work of Laguerre. 


In the case of the ‘‘rectangular’’ frequency function 


_ §1/(2h) if -hoxsh 
etl { Oifx >h,orx< —h, (8) 
(5) turns into the familiar “integral-mean’’ transformation 
1 
g(x) = 5 Sai” fat (9) 


which is often used for practical or theoretical smoothing purposes, a fact 
which may justify the following remarks. The transformation (9) is not 
variation-diminishing, because (8) is not a Polya frequency function. 
Indeed, the transform of (8) is 

1 ee sinh(sh) 

2h uae’ . sali sh 
which is certainly not of the necessary form (7) because it has (infinitely 
many) zeros. An example when (9) increases the number v(f) is easily 
constructed as follows. Let m be an integer, m 2 3, and let 


fiz) = 1+ cosx ifx< -—rorx >f, 
~ Yl+cosx — (n+ 3/.) if -—rSxS zm. 
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Evidently v(f) = 2. However, with 2h = (2n + 1)z, we find that (9) 
transforms our f(x) into g(x), where 


(—1)) 


g(x) = 7 ~ cos * in the range —(n —'/2)aS x (m — ¥/s)m. 





Thus v(g) = 2(m — 1) > o(f). 
The term ‘‘Polya frequeney functions” for the functions of our class P 
seems justified for the following reasons. Let 


1 mg 
HW) * yo (oy ws’, (wo ¥ 0), (10) 
be the power series expansion of the meromorphic function (4). Evidently 
S% A(x)x’dx, (vv =0,1,2,...). (11) 


If f(t), of (5), is a polynomial of degree n, then (5) may be evaluated as 
follows: 


g(x) = f 8 A(t) f(x — t)dt = fe _A® E ak fp wat 


or 


1 1 
g(e) = fle) — Gms) +. * Fi mF MO) = GHIO)* (2) 


In this connection Polya proved in 1915 the following remarkable result’: 
“Let (10) be a formal power series (formal means that no convergence is 
assumed) with the following property (S): If f(x) is an arbitrary real 
polynomial, then the number of real zeros of the transformed polynomial 
g(x), given by (12), should never exceed:the number of real zeros of the given 
polynomial f(x). A power series (10) enjoys the property (S) if and only 
if it is the Taylor expansion of the reciprocal +1/W(s) of an entire function 
.W(s) of the form (4’).” Thus our Theorem was discovered by Polya in 
1915 for the case when f(x) and g(x) are polynomials,* the convolution 
transformation (5) being replaced by the differential operator relation (12). 
These remarks show that the Polya frequency functions A(x) appear im- 
plicitly in Polya’s work of 1915 in terms of their moments (11). 

In concluding we wish to mention a result of Th. Motzkin which is 
used in the proof of our Theorem. In 1930 the author showed that the 
linear transformation (1) is certainly variation-diminishing if the matrix 
A does not possess two minors of equal orders and of opposite signs. He 

_also showed that conversely, if (1) is variation-diminishing, then A cannot 
have two minors of equal orders and of opposite signs, provided the rank of 
Ais =m. The necessary and sufficient conditions in order that (1) be 
variation-diminishing were discovered in 1933 by Th. Motzkin.® They 
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are as follows: Let r be the rank of A; then A should not have two minors 
of equal orders and of opposite signs if their common order is <7, while 
if their common order is =r then again they should never be of opposite 
signs if they belong to the same combination of r columns of A. 

In his contribution to the Courant Anniversary Volume” the author 
has recently discussed the discrete analog of the problem of the present 
note, namely, the variation-diminishing (sequence) convolution trans- 
formations 


oe ee a eee ns ot eee (13) 


y= — @ 


in their relation to the sequences {L,,} satisfying the following two condi- 
tions: (1) The 4-way infinite matrix |) has only non-negative minors; 








(2)0< >) L, < o. These sequences are the discrete analogs of the 


Polya frequency functions. 
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if n — +o; (2) forevery bounded sequence tech. (13) implies the inequality v(y) S v(x). 
The first condition is unnecessarily restrictive; indeed, it is readily seen that Theorem 2 
(loc. cit., page 363) and its proof remain valid if we define a variation-diminishing trans- 
formation (13) by the following two conditions: (1’) 0 < =|L,| < o, L, > 0 for some 
n; (2) v(y) S v(x) for bounded {x,}. 


PHYSICAL CURVES IN GENERALIZED FIELDS OF FORCE* 
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1. The differential geometry of the dynamical trajectories and certain 
important physical systems of curves of general positional fields of force 
has been developed by Kasner in his Princeton Colloquium Lectures.! 
Recently we have introduced the concept of generalized fields of force 
which depend not only on the position of the point but also upon the 
direction through the point. Positional fields may be described as iso- 
tropic and generalized fields as antsotropic.? 

If (¢, ¥) are the rectangular components of a generalized field of force 
in the plane, the corresponding equations of motion of a particle are 


9° 


de? 





(90), 2 = vey, (1) 
The cartesian codrdinates of a point are denoted by (x, y), the slope throuth 
the point by » = dy/dx, and the time by ¢. We assume that the slope of 
the force- vector is not identical with p. For otherwise, the trajectories 
are all straight lines and the field of force is of no interest. Ina generalized 
field of force, there are ~* trajectories, just as in the positional case. 
Upon eliminating the time ¢ from the differential equations (1) of motion 
of a particle, we find that the differential equation of third order defining a 
eect eter) family of ©* curves 1 


da 2 
wo — po) 5% = te + Bids — 40) — p45] 2 + ts — po» — 361(2Y. 
(2) 


2. A system S, of curves in a generalized field of force consists of curves 
along which a constrained motion is possible so that the pressure P is 
proportional to the normal component N of the force vector. Thus 
P=kN. There are ~* curves in a given system S;, for a fixed k. 

The differential equation of third order defining any system 5S, is 











arenas: 
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's ad ips pe 4 ee a 
(y eis. pd) dx? eat ly. + PY rz) p dy] dx? i [vo Pop 3¢ 


= DO+ W) (HV 





1 + p? dx? 
The relation between k and n is 
2 
a 4 
Rim Oy x (4) 
The system S; is undefined fork = —lorn = o., 


The important systems of physical interest are: 


The system Sp of trajectories defined by k = 0 or n = 2. 

The system S, of general catenaries defined by k = 1 orn = 1. 

The system S_, of general brachistochrones defined by k= —2 or n= —2. 
The system S.. of velocity curves defined by k = © orn = 0. 


3. Every system S,; of ~* curves is of the type (G), that is, it is defined 
by a differential equation of third order of the form 


d’y 2 


oF ay (2 
peas G(x, y, p) dx? + (x, y; p) dx? } 


dx’ 


(5) 


Conversely we have proved that except for the velocity systems S., any 
family of ~* curves of the type (G) always represents a system S, in a general- 
ized field of force. There are ~* such fields of force. 

On the other hand, we have shown that a family of curves of the type 
(G) is a velocity system S., if and only if it is a system of curvature trajectories. 

Thus the differential equation of third order defining any velocity system 
S. is : 

3 2. 2. 2 
OY = (as + pay) $2 + a» (52): 6) 
This system of + curves consists of the curvature trajectories of the 
family of ~? curves: d?y/dx? = e*” 

For the positional case, the functions G and H need to be specialized 
further. A characteristic set of five geometric properties for the system 
S; in positional fields of force have been obtained by Kasner.* 

4. Weshall define the angular rate \ which is associated with any lineal- 
element E of the plane for the motion of a particle when acted upon by a 
generalized field of force. As the lineal-element E rotates about its point 
P, the corresponding force vector F also rotates about P. The angular 
rate \ is the instantaneous rate of change of the inclination of F with 
respect to the inclination of E. It is given by the formula 
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_ (L+ £) (br — vr) 
e+e 
For any ordinary positional field of force, \ = 0. However, the angular 
rate \ vanishes not only for positional fields but also for those generalized 
fields of force such that the direction of the force vector at any point P 
does not vary but its magnitude does. 
The possible Jines of force are defined by 


r 





(7) 


v(x, y, p) ae po(x, y; p) = 0. (8) 
Along any line of force, the angular rate A is 
Vp — Pdp 
ja ae, (9) 
: C 


If there exists a lineal-element E, which satisfies (8) and if \ + 1 at Ey, 
then (8) can be solved for p in the neighborhood of Eo, that is, p = f (x, y). 
In general, this has ~' integral curves, namely, the lines of force. 

5. Ata point P, there is, in general, a single particular curve Cy of the 
system S, such that the particle describing this curve Cy has zero speed 
at P. The direction of Cy) at P coincides with that of the force vector. 
Conversely there are ©! curves of the system 5S, in the direction of the 
force vector. The particle describing each one of these curves except 
Cy have speeds different from zero at P; but then the curvatures are zero . 
at P. For the particular curve Cy, the speed of the particle at P is zero 
but its curvature at P is not zero in general. For the system Sp of tra- 
jectories, the curve Cp is the rest trajectory. In any system S,, there are 
©? such curves Co. 

The ratio p of the curvature of the particular curve Cy to the curvature of the 
tangent line of force, is 

ee (1+ k(1 —d) 
p= = , (10) 
l+n-d (8+) — (1+ BD, 

For the system Sp of trajectories, this is p = (1 — A)/(3 — A). 

For the system S; of catenaries, this is p = (1 — A)/(2 — A). 

For the system S_, of brachistochrones, this is p = (A — 1)/(A + 1). 

For the system S of velocity curves, this is p = 1. 

The above result generalizes various extensions of the theorem of Kasner 
concerning one-third of the curvatutes.‘* 

6. An arbitrary lineal-element transformation, not necessarily of the 
contact type, will send the ~* curves of every system S, (including k = @) 
into the ~* curves of a system S, if and only if it is an extended collineation 
or correlation, ; 

In the positional case, we have proved that the only transformations 
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which convert every system S, except k = 0 into a system S, are the 
similitudes. The only appropriate transformations for the systems Sy 
are the collineations (related to Appell’s transformation). 


* Presented to the American Mathematical Societies, 1948. 
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